What is claimed is: 



1 . A method for indexing feature vector data space 
comprising the step of: 

(a) adaptively approximating feature vectors on the basis of 
statistical distribution of feature vector data in the feature vector data 
5 space. 

2. The method of claim 1 , wherein the step (a) further 
comprises the steps of: 

(a-1) measuring the statistical distribution of the feature vector 
data in the feature vector data space; 
5 (a-2) estimating marginal distribution of the feature vector data 

using the statistical distribution; 

(a-3) dividing the estimated marginal distribution into a plurality 
of grids in which a probability of disposing the feature vector data in 
each grid is uniform; and 
10 (a-4) indexing the feature vector data space using the divided 

grids. 

3. The method of claim 2, further comprising prior to step (a- 
4), the step of updating the grids on the basis of a previous probability 
distribution function and an updated probability distribution function, 
when new data is received. 
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4. The method of claim 2, wherein step (a-4) further 
comprises indexing using vector approximation (VA) files. 

5. The method of claim 2, wherein a number of the plurality 
of grids is determined by a number of bits assigned to the dimension. 

6. The method of claim 2, wherein step (a-2) further 
comprises the steps of: 

(a-2-1) defining a probability distribution function using a 
weighted sum of a predetermined distribution function; and 
5 (a-2-2) obtaining an estimated probability distribution function by 

estimating predetermined parameters using the probability distribution 
function defined in the step (a-2-1). 

7. The method of claim 6, wherein step (a-2-2) further 
comprises obtaining the estimated probability distribution function by 
estimating the predetermined parameters using all N predetermined 
data in each estimation, wherein N is a positive integer, on the basis of 

5 an expectation-maximization algorithm using the probability distribution 
function defined in the step (a-2-1). 

8. The method of claim 6, wherein the predetermined 
distribution function is the Gaussian function. 
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9. The method of claim 6, wherein the probability distribution 
function of step (a-2-1) is a one-dimensional signal, p(x), wherein 



N 

p(x) - X Pi^J^P^J^ ' wherein pix\j) is defined as 



wherein coefficient PQ) is a mixing parameter that satisfies the 

M 

following criterion 0<P(j)<l and 2]^0) = 1- 



1 0. The method of claim 6, wherein the estimated probability 
distribution function of step (a-2-2) is obtained by finding O/, y=1 M, 



which maximizes 0(0,,..., O^^) = nF(v[/]|(0„...,Oj^,)) , where 
parameters v[/], /=1 ,...,A/, is a given data set. 

1 1 . The method of claim 1 0, wherein the estimated 
parameters of step (a-2-2) are updated according to the following 

i:/'(y|v[/])'v[/] 

equations //j*' = -^=^7;; , 

1=1 



20 



1 I 

P(jy^^ = — y]p(j\v[l]y , wherein t is a positive integer 

representing a number of iterations. 

1 2. The method of claim 1 1 , wherein the estimated 
parameter set of step (a-2-2) using N data v[l] is given as 
[p{j)^ ,(aj)^}, and the updated parameter set for new data v[N+1], 
coming in, is calculating using the following equations: 

+C(v[7v+i]-//;), 



POY^' = POY +^—(P(j\v[N + 1]) - POY), and 



puHN+i]y ' 



1 3. The method of claim 1 1 , wherein the step (a-2-2) further 
comprises: 

measuring a change of a probability distribution function which is 



defined as ^ — ^^Poid(.^) — Pnewi^)) ^ for each dimension, wherein a 
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5 previous probability distribution function is Poid{x) , and an updated 

probability distribution function is Pnew{x) \ and 

updating an approximation for the dimension if p is larger than a 
predetermined threshold value. 

14. The method of claim 2, wherein step (a-3) further 
comprises dividing a probability distribution function into the plurality of 
grids to make areas covered by each grid equal, wherein the plurality 
of grids have boundary points defined by c[/], /= 0,...,2^, where b is a 
5 number of bits allocated and wherein the boundary points satisfy a 

criterion, J^[||'^ p{x)dx = ^\"}^^^^ p(x)dx, and wherein the estimated 

A 

probability distribution function is p(x) . 
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